Improving the speaker-dependency of subword-unit-based isolated word recognition

نویسندگان

  • Takuya Koizumi
  • Shuji Taniguchi
  • Kazuhiro Kohtoh
چکیده

This paper deals with a subword-unit-based isolated word recognition system with enhanced speaker-independency. The subword is defined as a part of word whose central portion has rather stationary or time-invariant short-time spectra with its portions near its ends having rapidly varying short-time spectra. In this system each isolated word is decomposed into a sequence of subwords, each of which is identified by means of a particular semi-continuous hidden Markov model that is named a subword HMM. Each isolated word is recognized by a particular set of concatenated subword HMMs that is designated as a word HMM. Subword boundaries within a word are detected by finding peaks of the magnitude of delta cepstral vectors obtained from the word. The system attains average word recognition rates over 87 % for a number of Japanese words uttered by ten native male speakers.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Speech Recognition Using Demi-Syllable Neural Prediction Model

The Neural Prediction Model is the speech recognition model based on pattern prediction by multilayer perceptrons. Its effectiveness was confirmed by the speaker-independent digit recognition experiments. This paper presents an improvement in the model and its application to large vocabulary speech recognition, based on subword units. The improvement involves an introduction of "backward predic...

متن کامل

Incorporating linguistic knowledge and automatic baseform generation in acoustic subword unit based speech recognition

A major challenge in speech recognition based on acoustic subword units is creating a lexicon which is robust to interand intra-speaker variations. In this paper we present two di erent approaches for incorporating simple word-level linguistic knowledge into the labelling step of the training procedure. The proposed systems also utilise a scheme for combined optimisation of baseforms and subwor...

متن کامل

Validating Di erent Flexible Vocabulary Approaches on the Swiss French PolyPhone and PolyVar databases

In this paper, we attempt to validate the exible vocabulary approach for speaker independent isolated word and connected words recognition. We compare the performance of classical whole word HMMs against di erent sets of subword units. For this purpose, we model phonemes, diphones and words of the (Swiss) French language. The recognition rates obtained with phoneme models are monitored as we in...

متن کامل

A Joint Segmentation and Labelling Scheme for use inAcoustic

A major challenge in speech recognition based on acoustic subword units is creating a lexicon which is robust to inter-and intra-speaker variations. In this paper we present a joint seg-mentation and labelling scheme to incorporate word-level linguistic knowledge into the training procedure. The proposed system is also based on a combined optimisation of the base-forms and the subword models. F...

متن کامل

Isadora | a Speech Modelling Network Based on Hidden Markov Models

In this paper we present the ISADORA system which provides highly exible speech recognition based on HMM technology together with an hierarchical representation of speech units. Markov model topologies, subword unit inventories, regular grammars expressed in nite-state or phrase structure style, and even the analysis tasks themselves are explicitly represented by the nodes of a large speech uni...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998